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Abstract. In these notes we formally describe the functionality of Calculating 
Valid Domains from the BDD representing the solution space of valid configu- 
rations. The formalization is largely based on the CLab 1 1 1 configuration frame- 
work. 



1 Introduction 

Interactive configuration problems are special applications of Constraint Satisfaction 
Problems (CSP) where a user is assisted in interactively assigning values to variables by 
a software tool. This software, called a configurator, assists the user by calculating and 
displaying the available, valid choices for each unassigned variable in what are called 
valid domains computations. Application areas include customising physical products 
(such as PC's and cars) and services (such as airplane tickets and insurances). 

Three important features are required of a tool that implements interactive configu- 
ration: it should be complete (all valid configurations should be reachable through user 
interaction), backtrack-free (a user is never forced to change an earlier choice due to 
incompleteness in the logical deductions), and it should provide real-time performance 
(feedback should be fast enough to allow real-time interactions). The requirement of 
obtaining backtrack-freeness while maintaining completeness makes the problem of 
calculating valid domains NP-hard. The real-time performance requirement enforces 
further that runtime calculations are bounded in polynomial time. According to user- 
interface design criteria, for a user to perceive interaction as being real-time, system 
response needs to be within about 250 milliseconds in practice [2]. Therefore, the cur- 
rent approaches that meet all three conditions use off-line precomputation to generate 
an efficient runtime data structure representing the solution space [3 4 5 6 1 . The chal- 
lenge with this data structure is that the solution space is almost always exponentially 
large and it is NP-hard to find. Despite the bad worst-case bounds, it has nevertheless 
turned out in real industrial applications that the data structures can often be kept small 
0511. 



2 Interactive Configuration 

The input model to an interactive configuration problem is a special kind of Constraint 
Satisfaction Problem (CSP) [8 9] where constraints are represented as propositional 
formulas: 



Definition 1. A configuration model C is a triple (X, D, F) where X is a set of vari- 
ables {x$, . . . , x n -i}, D = Do x ... x D n —\ is the Cartesian product of their finite 
domains Dq, . . . , D n -\ and F = {fa, / m _i} is a set of propositional formulae over 
atomic propositions Xi — v, where v E Di, specifying conditions on the values of the 
variables. 

Concretely, every domain can be defined as Di — {0, . . . , \Di\ — 1}. An assign- 
ment of values Vq, . . . , to variables xq, . . . , x n -\ is denoted as an assignment 
p = {(xo, Vo), . . . , (x n -x, v n -i)}. Domain of assignment dora(p) is the set of vari- 
ables which are assigned: dom(p) = {xi | 3v E Dj.(xi,v) E p} and if dom(p) = X 
we refer to p as a total assignment. We say that a total assignment p is valid, if it satisfies 
all the rules which is denoted as p \= F. 

A partial assignment p' , dom(p') C X is valid if there is at least one total assign- 
ment p D p' that is valid p (= F, i.e. if there is at least one way to successfully finish 
the existing configuration process. 

Example 1. Consider specifying a T-shirt by choosing the color (black, white, red, or 
blue), the size (small, medium, or large) and the print ("Men In Black" - MIB or "Save 
The Whales" - STW). There are two rules that we have to observe: if we choose the 
MIB print then the color black has to be chosen as well, and if we choose the small size 
then the STW print (including a big picture of a whale) cannot be selected as the large 
whale does not fit on the small shirt. The configuration problem (X, D, F) of the T- 
shirt example consists of variables X = {xi, Xi, £3} representing color, size and print. 
Variable domains are D\ = {black, white, red, blue}, D2 — {small, medium, large}, 
and L> 3 = {MIB, STW}. The two rules translate to F = {/i,/ 2 }, where /1 = 
(x 3 = MIB) => (xi = black) and f 2 = (x 3 = STW) (x 2 ^ small). There 
are |Z? 1 ||Z? 2 ||£ ) 3| = 24 possible assignments. Eleven of these assignments are valid 
configurations and they form the solution space shown in Fig.Q] 



(black, small, MIB) 
(black, medium, MIB) 
(black, medium, STW) 
(black, large, MIB) 



(black, large, STW) 
(white, medium, STW) 
(white, large, STW) 
(red, medium, STW) 



(red, large, STW) 
(blue, medium, STW) 
(blue, large, STW) 



Fig. 1. Solution space for the T-shirt example 



2.1 User Interaction 

Configurator assists a user interactively to reach a valid product specification, i.e. to 
reach total valid assignment. The key operation in this interaction is that of computing, 
for each unassigned variable Xi E X \dom(p), the valid domain D? C D ; L . The domain 
is valid if it contains those and only those values with which p can be extended to be- 
come a total valid assignment, i.e. D? = {v E Di \ Bp' : p' |= F ApU {{xi, v)} C p'}. 
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The significance of this demand is that it guarantees the user backtrack-free assignment 
to variables as long as he selects values from valid domains. This reduces cognitive 
effort during the interaction and increases usability. 

At each step of the interaction, the configurator reports the valid domains to the 
user, based on the current partial assignment p resulting from his earlier choices. The 
user then picks an unassigned variable Xj £ X \ dom(p) and selects a value from 
the calculated valid domain Vj € Dj. The partial assignment is then extended to p U 
{(xj,Vj)} and a new interaction step is initiated. 

3 BDD Based Configuration 

In 1151101 the interactive configuration was delivered by dividing the computational ef- 
fort into an offline and online phase. First, in the offline phase, the authors compiled a 
BDD representing the solution space of all valid configurations Sol = {p | p \= F}. 
Then, the functionality of calculating valid domains (CVD) was delivered online, by 
efficient algorithms executing during the interaction with a user. The benefit of this ap- 
proach is that the BDD needs to be compiled only once, and can be reused for multiple 
user sessions. The user interaction process is illustrated in Fig. [2] 

InCo(Sol, p) 



1 : while |5o/ p | > 1 

2: compute D p = CVD(Sol, p) 

3 : report D p to the user 

4: the user chooses (xi,v) for some £j0dom(p), v £ Df 

5: p <- pU {(x t ,v)} 

6 : return p 



Fig. 2. Interactive configuration algorithm working on a BDD representation of the so- 
lutions Sol reaches a valid total configuration as an extension of the argument p. 

Important requirement for online user-interaction is the guaranteed real-time expe- 
rience of user-configurator interaction. Therefore, the algorithms that are executing in 
the online phase must be provably efficient in the size of the BDD representation. This 
is what we call the real-time guarantee. As the CVD functionality is NP-hard, and the 
online algorithms are polynomial in the size of generated BDD, there is no hope of pro- 
viding polynomial size guarantees for the worst-case BDD representation. However, it 
suffices that the BDD size is small enough for all the configuration instances occurring 
in practice [10|. 

3.1 Binary Decision Diagrams 

A reduced ordered Binary Decision Diagram (BDD) is a rooted directed acyclic graph 
representing a Boolean function on a set of linearly ordered Boolean variables. It has 
one or two terminal nodes labeled 1 or and a set of variable nodes. Each variable node 
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is associated with a Boolean variable and has two outgoing edges low and high. Given 
an assignment of the variables, the value of the Boolean function is determined by a 
path starting at the root node and recursively following the high edge, if the associated 
variable is true, and the low edge, if the associated variable is false. The function value 
is true, if the label of the reached terminal node is 1 ; otherwise it is false. The graph is 
ordered such that all paths respect the ordering of the variables. 

A BDD is reduced such that no pair of distinct nodes u and v are associated with the 
same variable and low and high successors (Fig.|3^), and no variable node u has iden- 
tical low and high successors (Fig. [3J)). Due to these reductions, the number of nodes 




(a) (b) 

Fig. 3. (a) nodes associated to the same variable with equal low and high successors 
will be converted to a single node, (b) nodes causing redundant tests on a variable are 
eliminated. High and low edges are drawn with solid and dashed lines, respectively 



in a BDD for many functions encountered in practice is often much smaller than the 
number of truth assignments of the function. Another advantage is that the reductions 
make BDDs canonical ifTTl . Large space savings can be obtained by representing a col- 
lection of BDDs in a single multi-rooted graph where the sub-graphs of the BDDs are 
shared. Due to the canonicity, two BDDs are identical if and only if they have the same 
root. Consequently, when using this representation, equivalence checking between two 
BDDs can be done in constant time. In addition, BDDs are easy to manipulate. Any 
Boolean operation on two BDDs can be carried out in time proportional to the product 
of their size. The size of a BDD can depend critically on the variable ordering. To find 
an optimal ordering is a co-NP-complete problem in itself fPTll . but a good heuristic for 
choosing an ordering is to locate dependent variables close to each other in the order- 
ing. For a comprehensive introduction to BDDs and branching programs in general, we 
refer the reader to Bryant's original paper IfTTl and the books II 1 21 1 311 . 

3.2 Compiling the Configuration Model 

Each of the finite domain variables Xi with domain Di = {0, . . . , |Dj| — 1} is encoded 
by ki — \log\Di\~] Boolean variables x l , . . . ,x 1 k ._ 1 . Each j e Di, corresponds to a 
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binary encoding vq . . . _ l denoted as vq . . . Ufc 4 -i = enc(j). Also, every combina- 
tion of Boolean values vo ■ ■ ■ ffci-i represents some integer j < 2 ki — 1, denoted as 
j = dec(vo . . . Vki-i)- Hence, atomic proposition xi = v is encoded as a Boolean ex- 
pression Xq = vq A . . . A x\.._i = Vki-i- In addition, domain constraints are added 
to forbid those assignments to vq . . . Ufc 4 _i which do not translate to a value in Di, i.e. 
where dec(vo . . . i^-i) > |-Dj|. 

Let the solution space So/ over ordered set of variables x < . . . < be repre- 
sented by a Binary Decision Diagram B(V, E, X^, R, var), where V is the set of nodes 
u, E is the set of edges e and X^ = {0, 1, ... , \Xb\ — 1} is an ordered set of variable 
indexes, labelling every non-terminal node u with var(u) < \Xf,\ — 1 and labelling 
the terminal nodes Tq,Ti with index \Xb\. Set of variable indexes X^ is constructed 
by taking the union of Boolean encoding variables (J™=o i x hi ■ ■ ■ i x \ -ll an( ^ ordering 
them in a natural layered way, i.e. < x 1 ? iff i\ < 12 or i\ = 12 and ji < ji. 

Every directed edge e = (u 1,1*2) has a starting vertex iti = 7Ti(e) and ending 
vertex U2 = ^(e). i? denotes the root node of the BDD. 

Example 2. The BDD representing the solution space of the T-shirt example introduced 
in Sect.|2]is shown in Fig.@] In the T-shirt example there are three variables: x\, X2 and 
X3, whose domain sizes are four, three and two, respectively. Each variable is repre- 
sented by a vector of Boolean variables. In the figure the Boolean vector for the vari- 
able Xi with domain Di is (x®,xj, ■ ■ - x 1 ^" 1 ), where k = \\g\Di\~\. For example, in 
the figure, variable X2 which corresponds to the size of the T-shirt is represented by the 
Boolean vector (x%, x\). In the BDD any path from the root node to the terminal node 
1, corresponds to one or more valid configurations. For example, the path from the root 
node to the terminal node 1, with all the variables taking low values represents the valid 
configuration {black, small, MIB). Another path with x\, x\, and x^ taking low values, 
and x\ taking high value represents two valid configurations: (black, medium, MIB) 
and (black, medium, STW), namely. In this path the variable x§ is a don't care variable 
and hence can take both low and high value, which leads to two valid configurations. 
Any path from the root node to the terminal node corresponds to invalid configura- 
tions. 



4 Calculating Valid Domains 

Before showing the algorithms, let us first introduce the appropriate notation. If an 
index k G X^ corresponds to the j + 1-st Boolean variable encoding the finite 
domain variable Xi, we define var\(k) = i and var2(k) = j to be the appropriate 
mappings. Now, given the BDD B(V, E, X&, R, var), Vi denotes the set of all nodes 
u G V that are labelled with a BDD variable encoding the finite domain variable Xi, i.e. 
Vi = {u G V j vari (u) = i}. We think of Vi as defining a layer in the BDD. We define 
Irii to be the set of nodes u G Vi reachable by an edge originating from outside the Vi 
layer, i.e. in, = {u ^ Vi\ 3(u', u) G E. var\(u') < i}. For the root node R, labelled 
with iq — vari(R) we define Ini n — Vi — {R}- 

We assume that in the previous user assignment, a user fixed a value for a finite 
domain variable x — v, x G X, extending the old partial assignment p Q id to the current 
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assignment p = p id U {(x,v)}. For every variable Xi G X, old valid domains are 
denoted as D Pold , i = 0, . . . , n — 1. and the old BDD B Pold is reduced to the restricted 
BDD, B p (V,E,Xt,,var). The CVD functionality is to calculate valid domains Df 
for remaining unassigned variables xt g" dom(p) by extracting values from the newly 
restricted BDD B P (V, E, X b , var). 

To simplify the following discussion, we will analyze the isolated execution of the 
CVD algorithms over a given BDD B(V, E, X},, var). The task is to calculate valid 
domains VDi from the starting domains Di. The user-configurator interaction can be 
modelled as a sequence of these executions over restricted BDDs B p , where the valid 
domains are Df and the starting domains are D Pold . 

The CVD functionality is delivered by executing two algorithms presented in Fig. 
[5] and Fig. [6] The first algorithm is based on the key idea that if there is an edge e = 
(ui, 112) crossing over Vj, i.e. vari(ui) < j < vari(u2) then we can include all the 
values from Dj into a valid domain VDj *— Dj. 

We refer to e as a long edge of length vari(u2) — vari(u\). Note that it skips 
var(u2) — var(u%) Boolean variables, and therefore compactly represents the part of a 
solution space of size 2 var ( u ^~ var( - u ^ . 

For the remaining variables Xi, whose valid domain was not copied by CVD — 
Skipped, we execute CVD(B, a;,) from Fig. [6] There, for each value j in a domain D[ 
we check whether it can be part of the domain Di. The key idea is that if j £ Di then 
there must be u G Vi such that traversing the BDD from u with binary encoding of j 
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CVD - Skipped(B) 

1: for each i = to n— 1 

2 : L[i] <- i + 1 

3 : T <— Topological Sort(B) 

4 : for each fc = to [T| - 1 
5: ui^T[fc], ii <— nari(ui) 
6: for each U2 G Adjacent[u{\ 
7: Z/[ii] <— max{L[ii], uari(u2)} 

8: s^O 

9 : for i = to n - 2 

10 : if i + 1 < L[s] 

11: L[s] <- raa:r{L[s],L[i + 1]} 

12 : else 

13: if s + KL[s] S <- 5" U {s} 

14: s <- i + 1 

15: for each j € S 

1 6 : for z = j to L[j] 

1 7 : <- D 4 



Fig. 5. In lines 1-7 the array is created to record longest edge e = (mi, U2) orig- 
inating from the Vi layer, i.e. L[i] = max{vari(u') \ 3(u,u') E E.vari(u) = i}. 
The execution time is dominated by Topological Sort(B) which can be implemented 
as depth first search in 0(|i?| + |^ |) = 0(|£7|) time. In lines 8-14, the overlapping long 
segments have been merged in 0(n) steps. Finally, in lines 15-17 the valid domains 
have been copied in 0(n) steps. Hence, the total running time is 0(\E\ + n). 



CVD(B,x t ) 

1 : VDi «- {} 

2: for each j = to |A| - 1 
3: for each k — Q to \Irii\ — 1 

4 : u <— 7rii [fc] 

5 : u' *— Traverse(u,j) 
6: if it' 7^ To 

7 : VA «- VA U {j} 

8 : Return 



Fig. 6. Classical CVD algorithm, enc(j) denotes the binary encoding of number j to ki 
values Vq, . . . , Ufc 4 -i. If Traverse(u, j) from Fig. |7]ends in a node different then To, 
then j 6 VA- 
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will lead to a node other than T , because then there is at least one satisfying path to Ti 
allowing X{ = j. 

Traverse(u,j) 



1 




i <— uari(u) 


2 




«o, . . . ,«fc,-i <- enc(j) 


3 




s <— i>ar2(w) 


4 




if Marked[u] — j return T 


5 




Marfced[w] <— j 


6 




while s < fci — 1 


7 




if «ari(u) > i return tt 


8 




if w s = u low(u) 


10 : 


else w <— high(u) 


12 : 


if Marfced[u] = j return To 


13: 


Marfced[w] <— j 


14 : 


s <— iiar2(u) 



Fig. 7. For fixed u e V,i = var\(u), Traverse(u, j) iterates through Vi and returns 
the node in which the traversal ends up. 

When traversing with Traverse(u, j) we mark the already traversed nodes u t with 
j, Marked[u t ] <— j and prevent processing them again in the future j-traversals 
Traverse(u' Namely, if Traverse(u, j) reached T node through u t , then any 
other traversal Traverse(u' , j) reaching u t must as well end up in To. Therefore, for 
every value j e D i7 every node u e Vi is traversed at most once, leading to worst case 
running time complexity of 0( \ Vi \ ■ \Di \ ) . Hence, the total running time for all variables 

isO(£r=oM-IA|). 

The total worst-case running time for the two CVD algorithms is therefore 0(X)"=o 
| A | + | J B|+n) = 0(Er=ol^|-|A|+n). 
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